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Description 

[00O1 ] The present invention relates to a plasmid vector of Rhodococcus , a new constitutive promoter, an expression 
vector containing said promoter, microorganisms transformed with the expression vector and their use in the production 
5 of proteins. 

[0002] Bacteria of the Rhodococcus genus are of wide interest in the field of the biodegradation and biotransformation 
of organic compounds (Warhurst and Fewson, 1994, Crit. Rev. Biotechnol., 14:29-73). 

[0003] Processes are known, for example, which use strains of Rhodococcus for the selective removal of organic 
sulfur from fossil fuels (U.S. 5,358,870. U.S. 5,132.219. PCT/US92/01868, EP-445896) and for the production of en- 
io zymes involved in the production of acrylamides (Kobayashi et al., 1992, Tends Biotechnol., 10:402-408), carboxylic 
acids. L-aminoacids (W09804733) and enantiomorphs of chiral compounds (U.S. 5.672,504). 

[0004] The main restricting factor in optimizing biocatalysis processes which use these bacteria is however the lack 
of suitable genetic instruments. 

[0005] This term refers to expression vectors in Rhodococcus which: 

15 

are present in cells in multiple copies; 

are steadily maintained inside the cells without costly selective agents (for example antibiotics), which considerably 
influence the economic convenience of an industrial process; and 

contain a strong promoter, i.e. capable of allowing an effective expression of a gene, or a strong constitutive pro- 
20 moter which does not require the use of inductors and Is not susceptible to repressors. 

[0006] In fact, a limit in the removal of organic sulfur from fossil fuels with strains of Rhodococcus which produce the 
sox enzymatic complex, is due to the presence, upstream the corresponding genes, of a promoter greatly inhibited by 
sulfate. 

25 [0007] To overcome this drawback, the genes encoding this enzymatic complex were placed in Rhodococcus vectors 
under the control of constitutive heterologous promoters such as that of the gene for resistance to chloramphenicol of 
Rhodococcus fascians (Piddington, C.S. et al., 1995, Appl. and Env. Microbiol., 61,2,: 468-475) or of the gene sacB 
of B.subtilis (Denis-Larose, C. et al, 1 998, Appl. and Env. Microbiol., 64, 1 1 ,:4363-4367) and of the gene for resistance 
to Kanamycin of E.coli (Serbolisca et al., Appl. Microbiol. Biotechnol. 1999, 52:122-126). The maintenance of the 

30 vectors, however, required the presence of a selective agent in the culture medium and, furthermore, the expression 
of the sox operon under the control of the promoter sac B proved to be very low (Lau. P. et al. 1999 ACS Fuel Chem.: 
32-33). 

[0008] It has now been found that the disadvantages of the known art described above can be overcome by the 
expression vector of the present invention. 
35 [0009] In accordance with this, a first objective of the present invention relates to the cloning plasmid vector pSM843 
stable in Rhodococcus . 

[0010] A further objective of the present invention relates to a new constitutive promoter of Rhodococcus capable 
of directing the expression of a homologous or heterologous gene with a high efficiency and characterized by the 
sequence SEQ. ID. Nr. 2. 

*o [ooi 1] Another objective of the present invention relates to an expression vector in Rhodococcus which comprises 
said constitutive promoter. 

[001 2] A further objective of the present invention relates to a strain of Rhodococcus transformed with said expression 
vector. 

[0013] Yet another objective of the present invention relates to a process for the production of homologous or net- 
45 erologous proteins in Rhodococcus bacteria transformed with said expression vector. 

[0014] Additional objectives of the present invention will appear evident from the following description and examples. 

Brief description of the figures 

so [0015] Figure 1 : shows the restriction map of the 11 kb plasmid pSM841 . 
[0016] Figure 2 : shows the restriction map of the 7.3 kb plasmid pSM843. 
[0017] Figure 3 : shows the restriction map of the E.coli plasmid pSM839. 

[0018] Figure 4 : shows the restriction map of the plasmid pSM846. Figure 5 : shows the restriction map of the plasmid 
PSM847. 

55 [0019] In particular, the expression vector according to the present invention comprises: 

(a) the rep genes, ORF81 and trbA which encode proteins Involved in replication in Rhodococcus ; 

(b) a gene called parA whose product is necessary for maintaining the plasmid in the absence of selective pressure 
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and is characterized by the sequence SEQ ID. Nr. 1; 

(c) a constitutive promoter of Rhodococcus having the sequence SEQ. ID. Nr. 2; 

(d) a multiple cloning site downstream the promoter and 

(e) at least one gene which encodes a genetic marker selected, for example, from genes of the cad operon (SEQ. 
ID. Nr. 3), which provide resistance to cadmium or genes which encode resistance to an antibiotic. 

[0020] The expression vector also contains the replication origin in E.coli and can therefore be used as shuttle vector 
in Rhodococcus and E.coli . 

[0021] The expression vector, indicated hereunder as pSM846, was obtained by: 

(1) construction of the cloning vector pSM843; 

(2) isolation of a constitutive promoter of Rhodococcus ; and 

(3) insertion of said constitutive promoter into the vector pSM843. 

15 [0022] The plasmid vector pSM843 was prepared by reducing the dimensions of the 1 30 kb plasmid of Rhodococcus 
s_p . DS7 containing the sox operon, the genes which confer resistance to cadmium, and those for resistance to arsenic 
(Margarit et al., 1997 and Serbolisca et al„ 1999). This reduction was effected by means of the following strategy: 

(a) search for plasmids deriving from the deletion of the 130 kb plasmid following transformation of a recipient 
20 strain and isolation of a 22 kb plasmid; 

(b) digestion of said plasmid with suitable restriction enzymes, self-ligase and isolation of the 11 kb plasmid 
pSM841; 

(c) characterization of the plasmid obtained in (b) and 

(d) construction of the 7.3 kb plasmid pSM843 containing at least one genetic marker and the genes parA, rep 
25 and trbA respectively necessary for stability and replication in Rhodococcus . 

[0023] The stability of the vectors obtained was controlled after each reduction so as to select only those maintained 
in at least 90% of the cells of the host strain for at least 30-40 generations, In complete medium and in the absence of 
selective pressure. 

30 [0024] Research on constitutive promoters inside the chromosome of a strain of Rhodococcus was effected for the 
construction of the expression vector, using a new method, which is included in the scope of the present invention. 
[0025] This method is based on the observation that strains of Rhodococcus have the capacity of integrating at 
random fragments of foreign DNA in their chromosome, without the necessity for a sequence homology higher than 3 
bp between the donor DNA and that of the host. The integration effectiveness was estimated at about 1 O 2 -! O 3 colonies 

35 per ug of DNA in Rhodococcus transformation experiments. 
[0026] The method consists in: 

(i) transforming a strain of Rhodococcus directly with a gene reporter without its promoter or with a multicopy 
plasmid of E.coli containing said gene and linearized upstream the gene reporter; 
*o (ii) selecting the clones which express said gene, i.e. the clones which have integrated the gene reporter in their 

chromosome, downstream a promoter sequence; 

(iii) digesting the chromosomal DNA of the selected clones with restriction enzymes which cut upstream and down- 
stream the gene; 

(iv) amplifying the DNA obtained in step (iii); and 

*s (v) sequencing the promoter upstream the gene reporter. 

[0027] Gene reporter refers to a fragment of DNA which encodes a product that allows the selection of the clones 
which express it. Examples of gene reporters useful for this purpose can be selected from those which encode resist- 
ance to antibiotics or heavy metals or enzymes such as XylE or the same Sox proteins. 
so [0028] With respect to the techniques currently used, this method is much more efficient and rapid as it does not 
require the preparation of genome banks with fragments of chromosomal DNA upstream the gene reporter. This method 
can be applied to all microorganisms which, tike Rhodococcus . are capable of integrating at random fragments of 
foreign DNA in their chromosome without the necessity for a high sequence homology between the donor DNA and 
that of the host. 

55 [0029] With this method, a constitutive promoter was identified of a gene of Rhodococcus having the sequence SEQ. 
ID Nr. 2. This promoter was inserted into the plasmid vector psM843, obtaining the expression vector pSM846 which 
can be used for the production of proteins of interest in Rhodococcus . 

[0030] The segregational and structural stability of the expression vector pSM846 in strains of Rhodococcus was 
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determined by operating as described in Example 4. The results demonstrated that the vector is maintained in over 
90% of the cellular population even after subsequent culture passages without selective agents, thus showing a high 
segregational stability. 

[0031] Furthermore, analysis of the plasmids isolated from the transformed strains showed that, even after various 
s generations in liquid culture, this plasmid remains structurally stable in different strains of Rhodococcus with from 4 to 
8 copies per cell. 

[0032] The expression vector of the present invention can be used for the expression of genes which encode proteins 
of interest such as, for example, enzymes involved in the selective removal of organic sulfur from fossil fuels (SoxA, 
SoxB, SoxC, SoxD). the production of L-amtnoacids (amidase, aspartase), the production of enantiomorphs of chiral 
io compounds (epoxide hydrolase, ketoesteroreductase), the production of carboxylic acids (nitrilase), etc. 

[0033] The expression vector can be used in various species of Rhodococcus and in phylogenetically similar bacteria 
such as Gordona and Nocardia . 

[0034] The capacity of the promoter of directing the constitutive expression of the gene put under its control was 
verified by positioning downstream said promoter, the sox operon isolated according to what is described by Serbolisca, 
is L., de Ferra, F., Margarit, I., Appl. Microbiol. Biotech nol., 1999, 52:122-126. 

[0035] The results obtained demonstrated that this promoter allows an effective and constitutive expression of the 
SoxA, SoxB and SoxC proteins in the presence of inorganic sulfur in the culture medium. 

[0036] The strains containing the vectors pSM843, pSM846 and pSM847 were deposited at the Centraalbureau Voor 
Schimmel-cultures as Rhodococcus SMV 112, SMV 113 and SMV 114, where they received the respective numbers 
20 CBS 1 02445, 1 02446 and 1 02447. 

[0037] The following were used in the experiments described hereunder: 

the 130 Kb plasmid containing the sox operon which encodes enzymes responsible for the conversion of diben- 
zothiophene (DTB) to 2-hydroxybiphenyl, and the genes which encode resistance to cadmium and arsenic. Said 
25 plasmid was isolated from the strain Rhodococcus sp. DS7 (Margarit, I. et al., 1997, 9 th Proceedings of the Inter- 

national Conference on Coal Science: 1579-1582); 

the plasmid pSM789, incapable of replicating itself in Rhodococcus , obtained by inserting into the plasmid of B 
coli pUC18, the 4549 bp fragment Hindlll-EcorRI which contains the sox operon without promoter isolated from 
Rhodococcus sp. DS7 (Margarit, I. et al., 1997, 9 th Proceedings of the International Conference on Coal Science: 
30 1579-1582). 

[0038] The Rhodococcus sp. DS7 strain was previously classified as Arthrobacter on the basis of microbiological 
test (D'Addario 1996 Proceedings of the Symposium AAA Biotechnology. Shejbal, E. and Ferrara: 139-149). Subse- 
quently, a comparison of the DNA 16 S sequence with that of the data banks indicated that the strain belonged to the 
35 Rhodococcus genus. The most consistent homologies were found with Rhodococcus erithreus and Rhodococcus eryth- 
ropolis . 

the Rhodococcus DS-2 strain, obtained from Rhodococcus sp. DS7, incapable of desulfurizing as it does not have 
the 130 kb plasmid (Margarit, I. et al. t 1997, 9 th Proceedings of the International Conference on Coal Science: 
40 1579-1582). 

[0039] The following examples are illustrative but do not limit the scope of the invention itself. 
EXAMPLE 1 

45 

Construction of the plasmid vector pSM843 

[0040] Electrocompetent cells of Rhodococcus DS-2 (100 uJ) (BIORAD Gene Pulser TM, 400 ohm, 25 uF) were 
transformed with the 1 30 kb plasmid (100 ng). The recombinant clones were selected on plates of LB-agar containing 

so 0.5 mM of CdCl 2 and subsequently plated on minimum medium (10 g/l of KH 2 P0 4 pH 7.4, 2.5 g/l of NH 4 Ct, 0.2 g/l of 
MgCI 2 -6H 2 0, 0.02 g/l of CaCI 2 , 0.01 g/l of FeCl,, 0.005 g/l of MnCI 2 -2H 2 0, 0.003 g/l of 2nCI 2 , 0.0009 g/l of CuCI 2 2H 2 O f 
agarose 0.8%) containing dibenzothiophene (DBT) as sole sulfur source. One of the cadmium-resistant clones proved 
to be incapable of desulfurizing DBT. The plasmid DNA was extracted from this clone, which, after being tested on 
agarose gel at 0.7%, showed dimensions of about 1 00 kb. 

55 [0041 ] Operating as described above, from the transformation of DS-2 with the 1 00 kb plasmid, a 35 kb plasmid was 
obtained which no longer conferred resistance to arsenic, and from this a 22 kb plasmid. 

[0042] The plasmid stability was checked for the three vectors in order to select only those maintained in the host 
strain for at least 30-40 generations in the absence of selective pressure. 
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[0043] The 22 kb plasmid (1 ug) was digested with 4 U of the restriction enzyme Sex Al (Boehringer), in 10 ul of the 
salts supplied by the producer, incubating for 60 minutes at 37°C and 15 minutes at 70°C. The linearized plasmid (2 
ul) was self-ligated in 10 ul of buffer, in the presence of 1 U of T4 DNA ligase, for 16 hours at 16°C. 1 ul of the ligase 
mixture was used to transform DS-2 electrocompetent cells. A 18.7 kb plasmid was isolated from one of the clones 
5 containing plasmids with reduced dimensions. This plasmid was digested with the enzyme EcoRl obtaining a 1 5.8 kb 
vector which was further reduced by digestion with the enzymes Sspl and Dral, which leave truncated ends. 
[0044] The resulting 11 kb plasmid, called pSM841 (figure 1), was sequenced and showed the presence of: 

the cad operon which confers resistance to cadmium and having the sequence SEQ. ID. Nr. 3; 

io . r ep genes, ORF81 and trbA encoding proteins involved in replication described by Denis-Larose, C. et al., 1998, 
Appl. and Env. Microbiol., 64, 11: 4363-4367. A 15 bp sequence was identified between these two genes, which, 
by homology with other sequences, may correspond to the replication origin of the plasmid; and 
a gene parA whose sequence (SEQ ID Nr. 1 ) shows a partial homology with that of genes involved in the partioning 
of the plasmids which takes place during the cellular division. 2 repeated sequences each of 24 bp, were identified 

*s upstream this gene. 

[0045] To confirm the importance of the DNA region containing the parA gene in maintaining pSM841 , an 8.8 kb 
plasmid was constructed without this region. The results demonstrated a reduction of over 50% in the plasmid stability. 
[0046] Once the genes indispensable for maintaining the plasmid pSM841 had been identified, a 7.3 kb vector was 
20 constructed, containing: 

1- a fragment Dral-EcoRI of 2717 bp comprising the cad operon; 

2- a fragment Sspl-Ncol of 271 6 bp comprising the genes trb A and rep A; 

3- a fragment EcoRI-Ncol of 1857 bp comprising the gene par A and a multiple cloning site (MCS). 

25 

[0047] These fragments were isolated from the plasmid pSM841 by means of amplification and then ligated in the 
presence of T4 DNA ligase at 16°C for a night. 

[0048] The resulting plasmid, called pSM843, has the restriction map indicated in figure 2. 
30 EXAMPLE 2 

Isolation of the constitutive promoter 

[0049] The plasmid pSM789 (5 u.g) was digested with 40 Units (U) of the restriction enzyme Hindlll (Boehringer) in 

35 50 til of buffer supplied by the manufacturer. 

[0050] After incubation at 37° C for 60 minutes the denatu ration/extraction of the proteins was effected by adding a 
volume of phenol-chloroform (1:1) and a volume of chloroform:isoamyl alcohol (23:1). The DNA was subsequently 
precipitated by adding 5 ul of a 3 M solution of Na-acetate and 300 uJ of ethanol to the aqueous phase and then 
resuspended in 1 0 ul of H 2 0. In this way, an open linear plasmid was obtained exactly upstream the RBS region of 

40 the first gene of the sox operon (sox A). 

[0051] The linearized DNA (0.2 ug) was used to transform 100 ul of cells of Rhodococcus sp . DS-2 and the trans- 
formants were then selected on minimum medium plates containing 0.04 g/l of DBT and 1% of C 2 H 5 OH as sole sources 
of sulfur and carbon, respectively. 

[0052] 100 clones capable of growing were isolated using DBT. As the plasmid pSM789 was incapable of replicating 
45 itself in Rhodococcus . the transformants obtained (about 1 0 2 per ug of DNA used) had to contain the sox operon inside 
their chromosome, downstream a promoter which allowed its expression. 

[0053] In order to determine whether the integration took place in different points of the chromosome, the chromo- 
somal DNA was extracted from 20 clones (Current Protocols in Molecular Biology - Vol 1 ) and digested with 20 U of 
the enzyme Notl, for which there is only one binding sequence inside the sox operon. The digested DNA was subjected 
so to electrophoresis on agarose gel 0.8%, visualized by colouring with EtBr 0.5% and transferred onto nylon (Nylon 
Membranes, positively charged - Boehringer) by means of Southern Blot (Sambrook, J., Fritsch, E.F., Maniatis, T. 1989 
Molecular cloning: a laboratory manual 2nd edn. Cold Spring Harbor, NY). 

[0054] The DNA was then hybridized with a probe whose nucleotidic sequence corresponded to a fragment of 770 
bp of the sox C gene, using the non-radioactive method provided by the DIG SYSTEM- kit Boehringer. The hybridization 
ss reaction was effected at 68° C. 

[0055] Of the clones tested a single band was observed, with a different molecular weight for each clone, which 

indicated a single integration event of the plasmid pSM789 in different points of the chromosome. 

[0056] In order to verify if these clones were capable of desulfurizing DBT in the presence of inorganic sulfur, they 
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were plated on minimum medium containing both DBT and MgS0 4 -7H 2 0 (0.20 g/l), in parallel with the Rhodococcus 
s£. DS7 strain. After 12 hours, upon exposing the plate to UV rays at 254 nm, a fluorescent halo was observed for all 
the recombinant clones, whereas no halo was visible for the native strain. This indicated that the activity of the isolated 
promoters was not repressed by sulfate. The clone which had the most consistent halo was called SMV110. 

5 

EXAMPLE 3 

Characterization of the promoter contained in SMV110 

io [0057] 1 u.g of chromosomal DNA of the clone SMV110 was digested with 20 U of the enzyme Clal, in 10 ul of buffer 
(Boehringer), at 37°C for 16 hours. After deactivation of the enzymes at 70°C for 15 minutes, 2 uJ of the digested DNA 
was treated with 1 U of the enzyme T4 DNA ligase in 10 ul of buffer (Boehringer), incubating at 16°C for 16 hours. 
[0058] An aliquot (1 ul) of the ligase mixture was used to transform cells of E.coli XL1-Blue made electrocompetent 
(Dower, W.J. et al. 1988 Nucleic Acids Research, 16: 6127-6145). 

is [0059] The transformants were selected on plates of agarized LB medium containing 100 ug/ml of Ampicillin. The 
ptasmid DNA extracted from one of the clones thus obtained was analyzed by restriction analysis. The results indicated 
that the ptasmid consisted of a 4.5 kb fragment, deriving from the chromosomal DNA of DS-2, and the plasmid pSM789. 
The map of this new plasmid, called pSM839, is showed in figure 3. 

[0060] The 4.5 kb fragment was then amplified with the Polymerase Chain Reaction (PCR) technique, (Leung, D. 
20 w., Chen, E., Goeddel, D.V., 1989 Technique- a journal methods in cell and molecular biology, 1, Nr. 1: 11-15), using 
the following pair of oligonucleotides: 



1) 5' CAGTCACGAC GTTGTAAAAC GA 3» (FORWARD) 



2) 5' TGCATTTGTC GTTGTTGAGT 3 • (REVERSE) 

30 

[0061 ] The amplification was carried out in a DNA Thermal Cycler 480 apparatus (Perkin-Elmer Cetus) using 1 00 ul 
of a reaction mixture containing: 5 ng of plasmid DNA, 60 pmoles of the two oligonucleotides, 200 uM of dNTPs (dATP, 
dGTP, oTTP, dCTP) and 1 U of Taq polymerase (Boehringer), in the buffer recommended by the manufacturer. After 
denaturation for 2 minutes at 94°C, the cyclic program was started, which comprises: 1 minute at 98°C, 1 minute at 

35 60°C and 3 minutes at 72°C for a total of 25 cycles, followed by 8 minutes at 72°C (final extension). 

[0062] The amplified 4.5 kb fragment was then sequenced by means of a DNA sequencer ABI 373 (Perkin Elmer). 
The sequence and possible open reading frames were compared with those present in DNA and protein banks using 
the research motor BLAST supplied by NCBI (Altscul, S.F., Madden, Th, Schaffer, A., Zhang, J., Zhang, Z., Miller, W., 
Lipman, D., 1997 Nucleic Acids Research 25: 3398-3402). 

40 [0063] The results showed that the integration of pSM789 in the chromosomal DNA of the DS-2 strain had taken 
place inside a gene which encodes a protein homologous to some endoglucanases and amylases of other microor- 
ganisms. 

[0064] It was therefore deduced that in the clone SMV1 10, the expression of the sox operon was controlled by the 
promoter of said gene. In order to characterize this promoter, the exact transcription starting point of the sox operon 
^5 was determined by primer extension using the 5* RACE System kit (BRL). 

[0065] The results indicated that the start of the transcription of the sox operon took place at a distance of 622 bp 

from 5' of the operon and that the promoter was therefore situated immediately upstream said region. 

[0066] Analysis of the sequence upstream the transcription starting site revealed the presence of the presumed -10 

region. 

so 

EXAMPLE 4 

Construction of the expression vector pSM846 

55 [0067] A 1100 bp fragment containing the constitutive promoter was amplified by PCR starting from the plasmid 
pSM839 using the following oligonucleotides: 
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<i) 5 ' GATCGGCCGGG ATCCACGAGT GTT 3' (forward) 



BamHI 



5 



( i i ) 5 ' ACCCAACACTGA GCTGTTAA CGGCCGGAGC GGCCGATGCA 



Hindlll Hpal Sfil 



TTTTAGGTGA TGCCCGGG 3' (reverse) . 



15 



Nsi 



[0068] The amplified fragment was digested with the enzymes Hindi!! and BamHI, eluted from agarose and inserted 
Into the plasmid pUC18. The resulting plasmid was digested with the enzymes Afllll-Hpal and ligated to pSM843, 
20 previously digested with Afllll and Ncol (which form compatible ends with the former) in the presence of T4 DNA ligase 
at 16°C for a night. A new vector shuttle of E.coli - Rhodococcus , called pSM846, was thus obtained, containing the 
promoter followed by an MCS for the constitutive expression of proteins in Rhodococcus . The strain of Rhodococcus 
DS-2 containing the plasmid pSM846 t whose map is showed in figure 4, was called SMV112. 

25 EXAMPLE 5 

[0069] In order to verify the stability of the plasmid pSM846 in the transformed strains of Rhodococcus after a pro- 
longed period of culture in the absence of selective pressure, three independent clones of the strain SMV112 were 
grown at 30°C, 200 rpm for 16 hours in 100 ml flasks, containing 20 ml of LB medium (Bacto Triptone 10 g/l, yeast 
30 extract 5 g/l, NaCI 10 g/l). 

[0070] The three cultures (0.1 ml) were used to inoculate a further 20 ml of the same mediums, and the new cultures 
were grown at 30°C, 200 rpm for 16 hours. This procedure was repeated a further 3 times, the cellular growth being 
followed as an increase of the optic density measured at 600 nm (O.D. 600). 

[0071] At the end of each growth, aliquots of cultures were removed, suitably diluted and then plated on LB agar 
35 medium. The plates were incubated at 30°C for 16 hours and the colonies were then counted (CFU/ml). In this way it 
was determined that there were at least 35 generations at the end of the experiment. 

[0072] In order to determine the percentage of cells which had maintained the plasmid for at least 35 generations, 
and were therefore resistant to cadmium, 100 single colonies deriving from the plating by dilution, after the 5th growth 
of the three cultures, were placed on LB plates containing CdC^ at a concentration of 0.5 mM. 

40 [0073] It was observed that the plasmid was maintained in over 90% of the cellular population, thus showing a con- 
siderable segregational stability even after subsequent passages in culture in the absence of selective pressure. 
[0074] In order to verify the structural stability of the plasmid pSM846, at the end of the 5th growth passage, the 
plasmid DNA was extracted from an aliquot of the cultures of 12 clones, using the method described in Serbolisca et 
al. (1 999). The DNA obtained were digested with various restriction enzymes for which there are single or double sites 

45 in the plasmid pSM846 and analyzed by means of electrophoresis on agarose gel for 2 hours at 90 V, 90 mA; the gels 
were coloured with Ethidium Bromide 1 mg/ml. The visualization of the gel on a UV light trans-illuminator showed the 
expected bands according to the restriction map of the plasmid, demonstrating that pSM846 was structurally stable in 
Rhodococcus for several generations in the absence of selective pressure. 

so EXAMPLE 6 

Determination of the number of copies of pSM846 per cell 

[0075] The average number of copies of the plasmid pSM846 per cell was estimated by means of quantitative PCR 
55 on preparations of total DNA from cells of Rhodococcus DS-2 containing said plasmid. The method is based on the 
determination of the quantity of DNA obtained by amplification of a fragment of the plasmid. This quantity is directly 
correlated to the concentration of the plasmid itself, used as a mould in the amplification reaction. 
[0076] The total DNA was prepared according to the method illustrated in Current Protocols in Molecular Biology - 
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Ausubel et al., Ed. Wiley Interscience Section 2.4.1 with the following modification: to obtain a complete cellular lysis 
the centrifuged cells from 1.5 ml of culture were resuspended in 567 microliters of TE (TrisHCI 10 mM. EDTA 1 mM. 
pH 8) containing lysozyme 50 ug/ml and were incubated at 37°C for 20 minutes before adding SDS and proteinase K 
according to protocol. 

[0077] After a treatment of 1 0 minutes with RNasi, the concentration of total DNA was estimated on agarose gel and 
the samples were diluted to 100 pg of DNA per microliter. 

[0078] PCR reactions were effected on the DNA thus obtained, using two oligonucleotides (primers) which appear 
inside the operon for resistance to cadmium and having the following sequence: 



5' TGGCCCGGCC GGAATTGATG GAC 3' (primer Cd4) 



and 



- 5' GCCGACGGCC GCGAT CGTG A TCAG 3' (primer Cdl7) . 

[0079] The standardization was effected by means of a second PCR reaction on the portion of chromosomal DNA 
encoding RNA 16S of Rhodococcus DS7 using oligonucleotides specific for this sequence. 

[0080] The number of copies of plasmid was estimated on both the strain containing the plasmid pSM846 and on a 
strain containing a single copy of the genes for resistance to cadmium integrated in the chromosome and on the strain 
DS7. 

[0081] The amplification reactions were carried out according to the instructions of the Syber Green kit of Perkin 
Elmer-AB. 

[0082] The amplified DNA was marked with the Syber Green fluorescent marker and quantified with a PE Applied 
Biosystems GeneAmp 5700 instrument. Each sample was triply amplified and the fluorescence values were compared 
with calibration curves included in each experiment for each pair of primers. The quantity of DNA obtained by amplifying 
with the primers specific for the plasmid was then corrected for the quantity of chromosomal DNA amplified with the 
primers specific for the RNA 16S genes. 

[0083] On comparing the three strains tested it was established that the strain DS7 contains an average of 2-3 copies 
of the 130 kb plasmid per cell, whereas cells of Rhodococcus DS-2 transformed with pSM846 contain from 4 to 8 
copies of plasmid per cell. 

EXAMPLE 7 

Construction of the plasmid pSM847 

[0084] The objective of the experiment was to obtain a strain containing several copies of sox genes under the control 
of the constitutive promoter identified. 

[0085] The fragment of DNA containing the promoter identified in example 3, was amplified starting from the plasmid 
pSM839 by means of PCR, using the following oligonucleotides: 

- 5' ATTCGAGAGT GCATATGCGG AAC 3' (FORWARD) 



- 5' CCATTTCTTC CAAGCTTCCG CCG 3' (REVERSE) 

which pair with the sequence SEQ. ID. Nr. 2, into which the restriction sites Ndel and Hindlll were introduced. 
[0088] About 500 ng of the DNA obtained from the amplification reaction were digested with 4 U of the enzymes 
Ndel and Hindlll for 60 minutes at 37°C and purified on Nusieve agarose gel at 1.5% (FMC products). Parallelly, 1 ug 
of the plasmid pSM789 was digested with 4 U of the same enzymes for 60 minutes at 37° C. 50 ng of vector and 25 
ng of insert purified from agarose were then ligated in the presence of 1 U of the enzyme T4 llgase. The ligase mixture 
was subsequently used to transform competent cells of E.coli XL-blue. 

[0087] 500 ng of the plasmid extracted from one of the clones of E.coli containing the desired insert, and 500 ng of 
the plasmid pSM843 were digested separately with 4 U of the restriction enzyme Sspl (New England Biolabs). The 
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reactions were carried out at 37°C for 60 minutes and at 70°C for 10 minutes. 

[0088] 150 ng of the linear plasmid pSM843 were then ligated with 50 ng of the plasmid of E.coli in 10 uJ of buffer 
(Boehringer), in the presence of 1 U of T4 DNA ligase, incubating for 16 hours at 16°C. 1 uJ of the ligase mixture was 
used to transform electrocompetent cells of Rhodococcus DS-2. The cells were plated on minimum medium in the 
5 presence of DBT and cultivated at 37°C for 4-5 days. 

[0089] The plasmid DNA was extracted from one of the colonies thus obtained, according to the method described 
in Serbolisca et al. (1999, Appl. Env. Microbiol. 52:122-126) and subjected to restriction analysis with enzymes for 
which there are sites in the starting plasmids. The vector thus obtained, containing the sox operon under the control 
of the constitutive promoter, was called pSM847 (Figure 5) and the strain containing it SMV114. 

10 

EXAMPLE 8 

[0090] The strain SMV11 4 was cultivated in parallel with the strain Rhodococcus DS7 in minimum medium plus DBT 
with and without 0.20 g/l of MgS0 4 -7H 2 0. 
is [0091 ] Electrophoresis and Western blot analysis of the soluble proteins extracted from the bacterial cultures showed 
that: 

1. when the strains are grown in DBT without sulfate, SMV114 expresses higher quantities of Sox A, B and C 
proteins than DS7; 

20 2. in the presence of inorganic sulfate, the strain Rhodococcus DS7 does not express the Sox enzymes, whereas 

SMV114 expresses the same quantity of enzymes with respect to growth without sulfate. 

[0092] The desulfurizing activity of the strain SMV114 was measured on resting cells using DBT as substrate and 
compared with that of the strain Rhodococcus sp. DS7. From a pre-inoculum in minimum medium, two inocula were 
25 effected for each strain in 100 ml of the same medium, without and with inorganic sulfur (0.20 g/l of MgS0 4 -7H 2 0, 80 
mM). The growths were effected for 20 hours at 30°C in flasks with breakwater protection. The starting optical density, 
measured at 660 nm, was OD 0.25 for the cultures in MgS0 4 + DBT and 0.4 for the cultures with DBT alone. The dry 
weight was triply determined, from 5 ml of centrifuged culture at 5000-6000 rpm for 1 0 minutes, frozen at -80°C and 
lyophilized. 

30 [0093] For the measurement of the desulfurizing activity, 1 0 ml of culture were removed and centrifuged at 5000-6000 
rpm for 1 0 minutes, at room temperature. The cells were resuspended in 9.75 ml of TrisHCI 20 mM pH 7, introduced 
into 50 ml flasks and incubated in a stirred bath at 30°C for 5 minutes. After adding 250 ul of DBT 40 mM to the 
suspension, a first sampling (1 ml) was effected immediately, which was extracted with 2 ml of ethyl acetate, to deter- 
mine the product background at time zero. Subsequent samples were taken after 1 and 2 hours. 

35 [0094] 20 ul of the clarified organic phase were analyzed by HPLC on a chromatographic column in C1 8 Vydac type 
TP218-5418 inverse phase, with a program comprising a flow of 1 ml/min for 15 minutes in 80% of acetonitrile. The 
specific activity, expressed as mg of 2HBP produced per hour per gram of dry weight, was calculated from the area 
corresponding to the product 2-hydroxybiphenyl (2 HBP) and taking in account the dry weight. 

[0095] The activity values obtained without inorganic sulfur corresponded to 5 +/- 0.6 mg/h.g both for the native strain 
40 and for SMV114. When cultivated in the presence of 0.2 g/1 of MgS0 4 , the Rhodococcus DS7 strain did not express 
any desulfurizing activity, whereas the SMV114 strain maintained its activity. 

EXAMPLE 9 

45 [0096] The possibility of using the plasmid pSM847 in other species of Rhodococcus was also examined. 1 00 ng of 
the plasmid were used to transform electrocompetent cells of strains of Rhodococcus erythropolis , Rhodococcus rho- 
dochrous and Rhodococcus opacus . In all cases colonies capable of desulfurizing DBT were obtained, which contained 
the plasmid having the estimated dimensions. Also in this case, at least 90% of the clones maintained the plasmid for 
about 40 generations in the absence of selective pressure. 

so 
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SEQUENCE LISTING 
5 SEQ ID NO: 1 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 22 90 base pairs 

w 

STRANDEDNESS : Single 
TOPOLOGY : Linear 

ATCGATTTTT CCGCCGGCGG CGCCGCCGCA TCCCACGCTG CCGTGAGTCC 50 
GTCCAAAGCT CGCAGCATCG ACGGGTGGTA CTTACGCAAC GCTGCCTGGC 100 
20 TGAACCCGGC GTTGAGGATG ATGTGCGCCT TCTGCCAGTT GGGTTCGGAG 150 

TTGAAAGCGG TGAAAAGGCC GTCCTCCGCG ACTGCCCGCA GCTTTCGGAT 2 00 
CGGCAGACCC ACCGATTTCG CCCACAATGC CTCGTTGTTG ACCTCGGCGA 250 

25 

CCAGCTCCAC CCCGGAGACC ACGGTCAGTC GGTGATTGAT GATCTTGCGC 300 
TCGAAGATCG GCCCCAGCCT CGAAGCCATG ACCATTTCCT TCTGGACAGG 350 

30 

CTTCACCGAG TCGACGCTGA GCAAGTCC CC CAACACAGGC AGCCGCCAGG 4 00 
ACGGCGACGG AATCGACTCG AACTCAAGGA CTGGTTTGAC CGCGACCTCG 450 
35 CTGGTGGGTG TCTCACTGCT CGGATACGGG CACTTGTCGG AAGACATGGT 500 

GGACAGTCTT GCATCAGTTC CGGACTAGAT GCCGACTCGA CGCAAAATGG 55 0 

40 
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GCGATGGAAC GCAAAGCCGC CATCCAGGAT GGAGCAGCGC GCCAAGAACC 6 00 
5 ATGCAAACCA GCG CTACTTG CGCAGACAAG CGGAAGTCGA ACGTGTCGTT 650 

GTTCCCCCTC ATCTTCACTT CCCCACTGCT CTACGGAACC TTCTGAGCTG 700 
io GCATTTTGCA TGATTTCCGA GCCCCGGAAC CAGCTATACA GTTTTACGGC 750 

AAACCGGGAA ACTGGAAAAC CGTATAGACG GCTACCCGGC AAAACGATAT 800 
GCTGTCTATA CGGATAGACG GGAAACCGGA TGCCTGGGAA ACTGGCTAAG 850 
TGTGTATACG GCTAACTGGC AAGCCGGGAA GCTGTTTTAC AGGTAACTGT 900 
TTTTACAGCT TTCCGGCAAA CCGGGAAACT GTAAAACCGG CAAACTGGGA 950 

ACTGTAAAAC CGGCAAACTG TAAACAAGCG AAAGGGCCCT T CTCGATGAT 1000 

CATCAGTCTC GTCAACACCA AAGGCGGAAC GGCCAAGACG ACGAGCGCGA 1050 

25 TCTAC CTTGC ACTCGCGTTT CATAATCGGG GGAGGAAGGT TGTCGTCCTC 1100 

GACTTGGATA AACAGGGTTC AGCAACTGAC TGGGCTGACC GCGCCACAGA 1150 

GGC CGGAGAT CCACTCCCGT TCCCAGTGCA TGTCGTGAAC ATGAAACGGC 1200 

TGGTGAAGTA CGCCACCGAT GGCGATGACC AGGTAGTAAT CATCGACACC 1250 

CCGCCCGGTG ACGGGCAAGT TATCGACGCT GCAATCGGGG CGTCCAACTT 1300 

CGTCATAATG CCCACGGCGG CTACAGGACT CGACACCGCC AGGGTCTGGG 13 50 

AGACGCTGCC CTCGGTGCAG GGCCGCCTTC CCTACGGAAT CCTCATTACC 1400 

TCCGCACGTC TCGGAACGAA CCTGCTTGAA GATGCCAAGG CAGCGTTCGA 1450 

CAGCAACGAC GCAGCCCGAT TCGACACCGT CATCCCCATG CGCGAGCGCA 1500 

TCCGGTCAAC ATTCGGTACT ACCCCCAAGC ACGACGAAGG GTATTCCGAC 1550 

GTCGTCGACG AGAT CACAGA GGCGCTGACA G CATGAG CAT CCCCAAGGCA 1600 

AAGACGACAC CGACGCTCGG GCCCCGCAAG TCCGCGCCAC CGATCGCGAC 1650 

GCCCTCAGTG ACGTCCAGCT TCGTCGAACC GGACGCAGAC CGCACGAAAC 1700 

TCACCGTCCA GATCGACGCT GAACTGCACC GACGCTTCAA AGCTGCTGTT 1750 

GCAGGAAGCG GCAAGAAGAT GCGCGACGTG GTCGAGGAAA TGATCGAGCA 1800 
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GTGGACCGAC GCGAACAGTG GTCGCTCATA ACTCGGTGAC GCTGTTACGC 1850 

5 

AAAACCAAAG GGAGCCTCGC CGGTTGCACC CGACGAGGCT CCAGACCGGG 1900 

AAACCCCTGG AAGGAAGAAA CCCAGTCATG ACCGACTTTA GCCTTCGGGC 1950 

10 AGCT CGTATT CAGCTAGGCG CATTGCTCGC CGCACTGGCA CTCGCGATCG 2000 

TCATTGCGCT CGGTATCACC ACCGGCGCAT TCGTACTTTC CTTCGCCGTG 2050 

CAACGCGACC TTGCACGGCA AGCACTGATC CCCGAACACC TGACCTGGAT 2100 

CTTCCCGGCG ATTGT CGACA GCGCTATCCT CGGCGCCACG ATCGCCATCG 2150 

TCATCATCAG CAAGCTCAAC ATGAACAAGC GCGACAGAGG CTT CT ACATC 2200 

GCACTCGCCG TCAGCGTTGT CGTGATCAGC ATCCTCGGAA ACGCGTACCA 2250 

CGCCTATCAC GCAGCAATCG CCGCGCAGGA GTCGATCGAT 2290 
SEQ ID NO: 2 



15 



20 



25 



30 



35 



40 



SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 1355 base pairs 
STRANDEDNESS : Single 
TOPOLOGY: Linear 
PROPERTIES : Promoter 

AT CGATCGGC CGGGATCGAC GAGTGTTGCC ATTTCACCGA GCACTTCGCC 50 
ACCGCGGATC TGACGGCGCC GATCTCGGTG ATGGAGTC CT GGTCGGCGCT 100 
TCCACCTGTG GTCTCGAGGT CGACGACGAC AAACGTCGTT TCGCTCAGCG 150 
as GTGTGTCCAG CTCGTCGAAG CTCAGTTGCC GCCCGACGTC GAACTTCTCC 200 

CCGTCGGAGA ACGGGGACGG AGACGGAAAT TCTCGGGCCG AGGCGGGACT 250 
CACACCGCAG ACAGTAGAAG GACCCCCCGA CACGAATTCG AGAGTGCATA 300 
CGCGGAACCC GGCTCACTCC GACTTCCATT GGGAAAGCAA CGGATTGACC 350 
CCCGACAAAA GTACGAGAGA CGAGGATCAC ACTTGCGGTG GGAACTCGCG 400 
AATG CGGACT CCTACCTACT TACCGGTAGG GCGATTCGTG AGGGTGTTGC 450 
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TACGGCGATC AGTCGCCGAA ACCGCCTGAT TGATCGAAAC GGGTTCTAGT 500 



TTCAAGTAAT CGGCGTTGAA CTGCATGGAA GATCGGATTG TCGGGTTTGG 550 

TGACCCTTGC CAGTGGGTAG CAAGCGCAGG TTTGCGGGCC AGGTGAGCCT 600 

CGCCGAACAG ACCTGTTGTT ATCTCTTCGT GACCCGTCCG GCGTATCGCG 650 

GGCGCGGCAG CATGATTTCC GGCGAATCGG GCGCGATTGT CACGAAGCGG 700 

TCGACATCGT CGCGATCATT T CGTAAC CTT CCTGAGACCT AACGAAGTCT 750 

TGCCGGACCA AACAGGACCG GCATCGTCAG GAACCGCCTA ATCGGGATCT 800 

CCGCGAGGAA CCCGAACCGG GAACCCAAGT TTCCACTGGG GTGAATC CCG 850 



CCTGGGGTGC AGCACGCATG ATCGCTGCGG ACCACGCGGG TAGGGCTGAT 900 

CTTCCCAGCC CGAACCCGTC AGCTAACTCG GTCGGCGGAT GAATGGAAGA 950 

AATGGAGCAC CCCTTAAGTG GCGTCACAGA GTTTCAAGCG TTCAGCGCAG 1000 

TTGGCAGTAG CCGGCGCGCT CGCAGTCGGA GCATTTGCTG CAACCGCTGC 1050 

ACCTGCCTCG GCAGACCCCA TCACCATCCC CGGCGTCGGC ACCTTCGAGG 1100 

TTCCGGGCGC TTCTATTCCT CAGTTGCCGG TCATCCCGGG CATCACCGAC 1150 

ATCGCACCGG CCGCTCCGGC CGCTCCCATC TCCAGTGTTG GTGAGCAGGC 12 00 



AGTTCGCGCC GCAGAGAGCA AGCTGGGCTC CCCGTACGTG TACGGCGCAT 1250 
CGGGCCCGGA CGCATTCGAC TGCTCCGGTC TGGTC CAGTG GGCATACAAG 1300 



CAGGCTGGTC TGAACCTGCC TCGCACGAGC TACGACCAGG CCGCAGCCTA 1350 
AATTT 



5 



20 



35 



45 



1355 



SBQ ID NO: 



3 



SEQUENCE TYPE : Nucleotide 



50 



SEQUENCE LENGTH: 28 06 base pairs 



STRANDEDNESS : Single 



55 



TOPOLOGY: Linear 
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PROPERTIES : ope r on 

AGATGC CACT GCGCCATTCC TGGTGAGGTA CGTCGGCCGA CCACCGTTCG 50 
GCTGGCGTAC CACATCCACT TTCAAGTCCT GCACATCCGC ACCTTTTTTA 100 
AATCGCCAAC TATCGCTATC AT CGTTATAT GACGATAAAT AAAGAGTCGG 150 
GGTGCCTCGC CGCAACCGGC AGCGGGTCCA GCTTGGATGC GGCAGTGGCC 200 
CTGTTCCACA GTCTCTCGGA CGGCACCCGA TTGTCGATCG TGCGCCGTCT 250 
CGCCGAAGGA GAGGCACGGG TTGCGGATTT GATCGGCGAA CTCGGCCTCG 300 
CCCAGTCGAC GGTGTCCGCA CATGTCGCAT GCCTACGTGA TTGCGGTTTG 35 0 
GTCGACGGCA G AC CGGAAGG TCGGCAGGTG TTCTACTCGC TGGCCCGGCC 400 
GGAATTGATG GAC CTACTCG CCTCGGCGGA GACGCTCCTT GCCGCGACAG 4 50 
GGAACGCGGT TGCCCTGTGC CCGAATTACG GCACCGACAT CGGAGATAGC 5 00 
CGTGAGTGAC GCGTGCGGCT GCGGCCACGA CGAACCCCGT GCCGAGGGCG 550 
AGGAAGAACA CGGGCCCGAA AAGTGGTGGC AGGTTACCGA GATC CGGGCG 60 0 
GCTGCAGCTG CGGGCGTGCT GCTGATCGCG GCCCTGACAG TCGGGCTGGC 65 0 
CGGCGGACCT GATGCGCTCG GGATCGGCCT CGAAGCGGGC GCGCTGATCA 70 0 
TTGCCGGCTA CACCTTCGTA CCGTCCACCC TCACACGCCT GGCGAAGGGC 7 50 
AAAATCGGGG TCGGCACCCT GATGACGATC GCGGCCGTCG GCGCGGTACT 800 
GCTCGGCGAG GTCGGCGAAG CAGCCATGCT CGCATTCTTG TTTGCGATCA 850 
GCGAGGGACT CGAGGAGTAC GCGGTCACCC GCACCCGCCG TGGCCTGCGC 900 
GCGTTACTGT CCCTGGTCCC GGACACCGCG ACAGTGCTCC GCGACGGCCG 950 
GGAGGAAACC GTCCCACCCT CGGACCTCGA GCTCGGTGAG GCCATGATCG 1000 
TCAAGCCCGG GGAGCGGATC GCCACCGACG GTGTCATTCG CGCCGGCCGC 1050 
ACCGCCCTGG ACACCTCCGC GATCACCGGA GAATCGGTGC CCGT CGAAGC 1100 
CGGTCCCGGC GACGAGGTAT TCGCCGGGTC GATCAACGGC ACCGGCGTCC 1150 
TCGAAGTCGA GGTCACCGCA GAGGCGCAGG ACAATTCGCT GGCGAAGATC 1200 
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GTGCGGATCG TCGAAGCTGA GCAGTCCCGC AAGGGCGAGG CGCAACGCCT 12 50 

CGCCGACAAG ATCGCCAAAC CGCTGGTTCC GGGCGTAATG ATCGTCGCCG 13 00 

CGGTTATCGC CGCGGTCGGC AGCGTGCTCG GTGATCCGCT GGTGTGGATC 1350 

10 GAACGAGCCC TGGTGGTACT GGTCGCCGCG TCCCCGTGCG CGCTGGC CAT 1400 

CGCGATACCC GTCACGGTCG TCGCCGCAAT CGGCGCAGCA TCCAAGCTCG 14 50 

GTGTCCTGGT CAAGGGCGGC GCAGCCCTCG AAGCGCTCGG CAGGATC CGC 1500 

ACCATCGCAT TGGACAAGAC CGGCACCCTC ACCCGCAACC AACCCGCCGT 1550 

GATCGACGTC GCGACCGCGA ACGGCGCCAA C CGCGGTGAC GTTCT CGCGG 1600 

TGGCCGCAGC GTTGGAGGCA CGCAGCGAGC ACCCGCTCGC TCGCGCGATT 1650 

CTCGCCGCCG TCGAGGACTA CACACCCGCT GATGACGCGG ACGCAGTCAT 1700 

25 CGGTGCAGGC CTGACCGGCT ACATCGACGG CATCCCCGTC CGGCTCGGTC 1750 

GACCAGGCTG GATCGACCCG GGGCCACTGA CCGGGGATAT CGAACGAATG 18 00 

30 CAGCACGCCG GCGCCACTGC GGTCCTGATC GAACGCGCCG GAACCGTGAT 1850 

CGGGGCGGTC GCCGTCCGCG ACGAACTCCG CCCGGAAGCA CGCGAAGTGG 1900 

TCACCGAACT ACGCCGCGGC GGCTACACCG TCGCGATGCT CACCGGCGAC 1950 

AACGAGCGCA CTGCCCGCGC TCTGGCCGCC GATGTCGGCA TCGACGACGT 2000 

GCACGCCGAT CTGCGCCCGG AGGACAAGGC GCGCATTGTG GAGAC CTTGC 2050 

GTGCGAGCCG ACCGACGGCG ATGGTCGGGG ACGGTGTCAA CGATGCCCCC 2100 

GCCCTGGCCA CCGCCGATCT GGGTATCGCG ATGGGTGCGA TGGGAACCGA 2150 

as CGTCGCCATC GAGAC CGCCG ATGT CGCTTT GATGGGCGAG GACCTGCGGC 2200 

ATCTTCCTCA GGCTCTGCAG CACGCGCGAC GATCGCGGTC GATCATGTTG 2250 

CAGAACGTCG GCCTGT CTTT GGCGATCATC ACCGTTCTGA TGCCTCTGGC 2300 

TCTGTTCGGT GTACTCGGAT TGGCTQCGGT CGTGCTCGTC CACGAGGTCG 2350 

CCGAGATCGT CGTCATCGCC AACGGGGTGC GCGCCGGGCG TACCCGAAGT 2400 

CTGGCTGCTG TACCGCAGTC GACCAGGGCG TCCGAGCCGG CCACGGTAGT 2450 
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ACGGCGATGA CATCTGCCGC CAGCTCCGGG TCATCAT CGG TGTCCGTGGT 2500 
CGGAATGCGT CGCGGCCGGG GTGTTCTCGT CATACTCGCA GGTGCCTTGG 2550 
CTGCGGTCGC GCTGCTCCTC GACCCGGTGG CACGTGGCGC GCTGTCCGGT 2600 
GGGCAGGTAG TCGATTTCGG GGTCTTGCAA CTGCGATTGG CCTACAACAG 2650 
CGGTGTCGCG TTCAGTCTCG GTGACCAGCT CCCCACAGTC GTCGTTCTCG 2700 
CCGGCACCGC TGCCCTCACC GCCGCGATCG GTGTGTTCGC CTGGCGCACA 2750 
G CAT CTGAGC GTCCCGTACT CCAGACCATC GGGCTGGCCG CGATCACGGC 2 800 
CGGTACC 28 06 

SEQ ID NO: 4 

SEQUENCE TYPE: Nucleotide 
25 SEQUENCE LENGTH: 22 base pairs 

STRANDEDNESS : Single 
TOPOLOGY : Linear 
PROPERTIES : Primer 
CAGTCACGAC GTTGTAAAAC GA 
SEQ ID NO: 5 
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35 



40 



SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 2 0 base pairs 
STRANDEDNESS : Single 
45 TOPOLOGY: Linear 

PROPERTIES : Primer 
TGCATTTGTC GTTGTTGAGT 

so 

SEQ ID NO: 6 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 23 base pairs 
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STRANDEDNESS : Single 
TOPOLOGY : Linear 
PROPERTIES : Primer 
GATCGGCCGGG ATCCACGAGT GTT 
SEQ ID NO: 7 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH : 58 base pairs 
STRANDEDNESS : Single 
TOPOLOGY : Linear 
PROPERTIES : Primer 

ACCCAACACT GAGCTGTTAA CGGCCGGAGC GGCCGATGCA 
TTTTAGGTGA TGCCCGGG 
SEQ ID NO; 8 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 23 base pairs 
STRANDEDNESS: Single 
TOPOLOGY : Linear 
PROPERTIES : Primer 
TGGCCCGGCC GGAATTGATG GAC 
SEQ ID NO: 9 

SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 24 base pairs 
STRANDEDNESS : Single 
TOPOLOGY : Linear 
PROPERTIES : Primer 
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GCCGACGGCC GCGATCGTGA TCAG 
SEQ ID NO: 10 
SEQUENCE TYPE: Nucleotide 
SEQUENCE LENGTH: 23 base pairs 
STRANDEDNESS : Single 
TOPOLOGY: Linear 
PROPERTIES : Primer 
ATTCGAGAGT GCATATGCGG AAC 
SEQ ID NO: 11 
SEQUENCE TYPE: Nucleotide 
25 SEQUENCE LENGTH: 23 base pairs 

STRANDEDNESS; Single 
30 TOPOLOGY : Linear 

PROPERTIES: Primer 
CCATTTCTTC CAAGCTTCCG CCG 

35 



Claims 

40 

1 . A cloning vector pSM843 which comprises: 

(a) the rep genes, ORF81 and trbA which encode proteins involved in replication in Rhodococcus; 

(b) the gene parA having the sequence SEQ. ID. Nr. 1; and 

45 (c) at least one gene which encodes a genetic marker selected from the genes of the cad operon, which confer 

resistance to cadmium, or the genes which encode resistance to an antibiotic. 

2. The vector according to claim 1, deposited with the number CBS 102446. 
so 3. An expression vector which comprises: 

(a) the rep genes, ORF81 and trbA which encode proteins involved in replication in Rhodococcus; 

(b) the gene parA having the sequence SEQ. ID. Nr. 1; 

(c) a constitutive promoter of Rhodococcus having the sequence SEQ. ID. Nr. 2; 
55 (d) a multiple cloning site downstreamr the promoter; 

(e) at least one gene which encodes a genetic marker selected from the genes of the cad operon, which confer 
resistance to cadmium, or the genes which encode resistance to an antibiotic; and 
(0 contains the replication origin in E.coli , deposited with the number CBS 1 02445. 
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4. The vector according to claim 3, which comprises downstream the constitutive promoter one or more genes which 
encode a protein of interest. 

5. The expression vector according to claim 4, wherein the proteins are selected from enzymes involved in the se- 
lective removal of organic sulfur from fossil fuels or enzymes involved in the production of L-aminoacids, enanti- 
omorphs of chiral compounds and carboxylic acids. 

6. The expression vector pSM847 according to claim 4, which comprises downstream the constitutive promoter the 
sox operon which encodes SoxA, SoxB and SoxC enzymes deposited with the number CBS 102447. 

7. The expression vector according to claim 3, obtained by: 



(1 ) construction of the cloning plasmld vector pSM843; 

(2) isolation of a constitutive promoter of Rhodococcus ; and 

J 5 (3) insertion of said constitutive promoter in the vector pSM843. 

8. A microorganism transformed with the expression vector according to claims 3 to 6, wherein said microorganism 
is selected from Rhodococcus , Gordona and Nocardia . 

20 9. a strain of Rhodococcus transformed with the expression vector pSM847 deposited with the number CBS 1 02447. 

10. A process for the production of homologous or heterologous proteins of interest which comprises cultivating, under 
suitable conditions, a microorganism transformed with the expression vector according to claims 3 to 6. 

25 ii. The process according to claim 1 0, wherein the protein is selected from enzymes involved in the selective removal 
of organic sulfur from fossil fuels and in the production of L-aminoacids, enantiomorphs of chiral compounds and 
carboxylic acids. 

12. The process according to claim 11 , wherein the proteins are Sox enzymes and the strain is Rhodococcus SMV1 14 
30 CBS 102447, 

13. A process for the removal of organic sulfur from fossil fuels characterized in that it uses a microorganism selected 
from Rhodococcus , Gordona and Nocardia transformed with the expression vector CBS 102445 containing the 
sox operon downstream the constitutive promoter. 



14. The process according to claim 13, wherein the microorganism is Rhodococcus SMV114 CBS 102447. 



15. A research method for promoters in microorganisms capable of integrating at random fragments of foreign DNA 
in its chromosome without requiring a sequence homology higher than 3 bp between the donor DNA and that of 
to the host, which consists in: 

(i) transforming said microorganism directly with a gene reporter without its promoter or with a multicopy plas- 
mld of E.coli containing said gene and linearized upstream the gene reporter; 

(ii) selecting the clones which express said gene, i.e. the clones which have integrated the gene reporter in 
45 their chromosome, downstream a promoter sequence; 

(iii) digesting the chromosomal DNA of the clones selected with restriction enzymes which cut upstream and 
downstream the gene; 

(iv) amplifying the DNA obtained in step (iii); and 

(v) sequencing the promoter upstream the gene reporter. 



16. The method according to claim 15, wherein the microorganism is a strain of Rhodococcus . 

17. The method according to claim 15, wherein the gene reporter is selected from those which encode resistance to 
antibiotics, heavy metals or enzymes such as XylE or Sox proteins. 

18. A constitutive promoter of Rhodococcus characterized by the sequence SEQ. ID. Nr. 2. 
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